Sorting Forum Responses by Relevance to Original Post Aaron

نویسنده

  • Aaron Abajian
چکیده

Online forums provide a medium for discussing niche topics. Forums consist of threads, where each thread consists of an original post followed by response posts. A subset of original posts are questions related to the forum’s topic. The responses to these posts are often expert answers due to the niche nature of the forum. However, not all responses will be answers. Users are free to write anything in their response post. In this project, we train a Näıve Bayes classifier and a linear support vector machine (SVM) classifier to distinguish thread answers from nonanswers. Our features are preprocessed whitespace-delimited terms occurring in each response. We obtained training and testing data by implementing a web crawler and a web parser targeted against the online discussion forum Slickdeals.net/forums (1). We crawled T = 8 threads whose original posts were questions. There was an average of 94.25 responses for each post, giving a total of N = 754 responses for classification. We manually assigned a labeling of 0 or 1 to each response depending upon whether that response attempted to answer the original question. We performed leave-one-out crossvalidation on the eight test questions. The Näıve Bayes classifier correctly classified 70.1% (110/157) of the answers and 74% of the non-answers (442/597). The linear-kernel SVM correctly classified 80.2% of the answers (126/157) and 82.5% (493/597) of the non-answers.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting Helpful Posts in Open-Ended Discussion Forums: A Neural Architecture

Users participate in online discussion forums to learn from others and share their knowledge with the community. They often start a thread with a question or sharing their new findings on a certain topic. We find that, unlike Community Question Answering, where questions are mostly factoid based, the threads in a forum are often open-ended (e.g., asking for recommendations from others) without ...

متن کامل

Review of “Computer-Assisted and Web-Based Innovations in Psychology, Special Education, and Health” edited by James K. Luiselli & Aaron J. Fischer

Computer-Assisted and Web-Based Innovations in Psychology, Special Education, and Health edited by James K. Luiselli & Aaron J. Fischer. London & San Diego: Academic Press, 2016. 408pp., $74.95 (hardcover), ISBN 9780128020753

متن کامل

The Criteria People Use in Relevance Decisions on Health Information: An Analysis of User Eye Movements When Browsing a Health Discussion Forum

BACKGROUND People are increasingly accessing health-related social media sites, such as health discussion forums, to post and read user-generated health information. It is important to know what criteria people use when deciding the relevance of information found on health social media websites, in different situations. OBJECTIVE The study attempted to identify the relevance criteria that peo...

متن کامل

The Impact of the Asynchronous Online Discussion Forum on the Iranian EFL Students’ Writing Ability and Attitudes

This paper focuses on the impact of an asynchronous online discussion forum on the development of students’ ability in and attitudes toward writing in English. To do this, 60 undergraduate students majoring in English were assigned to two experimental and control groups while receiving different types of feedback. Students in the experimental group were required to take part in an asynchronous ...

متن کامل

PAPER IN PROGRESS The Effect of Attention in the Representation of Relational Information

The reported study attempts to specify the role of attention in the representation of relational information while controlling for knowledge. In two domains, one knowledge-lean (propositional logic) and one knowledge-independent (object arrangements), a sorting task was used to attract attention either to relational features, to elementary features, or to irrelevant features. A subsequent categ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013